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Abstract. The most fundamental premise to the standard model of the universe 
states that the large-scale properties of the universe are the same in all directions 
and at all comoving positions. Demonstrating this hypothesis has proven to be a 
formidable challenge. The cross-over scale Ri SO above which the galaxy distribution 
becomes statistically isotropic is vaguely defined and poorly (if not at all) quantified. 
Here we report on a formalism that allows us to provide an unambiguous operational 
definition and an estimate of Ri SO - We apply the method to galaxies in the Sloan 
Digital Sky Survey (SDSS) Data Release 7, finding that Ri SO ~ 150/i _1 Mpc. Besides 
providing a consistency test of the Copernican principle, this result is in agreement 
with predictions based on numerical simulations of the spatial distribution of galaxies 
in cold dark matter dominated cosmological models. 
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1. Introduction 

The cosmological principle (CP), the assertion that the cosmic mass distribution appears 
homogeneous and isotropic, that is uniform, to a family of typical observers that move 
with the same average velocity of the surrounding matter (fundamental or comoving 
observers) has far reaching consequences in cosmology [1]. It entails that the geometry 
of space-time be highly symmetric and completely described by the simple Robertson- 
Walker metric [2j [3]. Furthermore, it implies that space expands at a rate that is set by 
the equations of Friedmann & Lemaitre [U |5]. 

The very first surveys of the three-dimensional distribution of optical galaxies 
showed that the topology of the large-scale structure is very complex and irregular 
[6J [7J . Because of this departure from exact uniformity, the CP is regarded as a coarse- 
grained model of the universe, a statistical description of the mass distribution that 
applies only on sufficiently large scales where the finest details of the galaxy clustering 
pattern become irrelevant. 

More recently, two-dimensional observations of the Cosmic Microwave Background 
(CMB) [8] have shown that the universe is extremely isotropic about us (to roughly 1 
part in 10 5 ), confirming earlier claims based on the analysis of the spatial distribution 
of local (z ~ 1) sources (e.g. [9]). What is challenging is to show that the universe 
is isotropic also about distant observers. As difficult as it may seem, it is important 
to attack the problem. Indeed, while isotropy at a specific position does not imply 
cosmic homogeneity (and viceversa), isotropy about every fundamental observer does 
imply overall homogeneity [10]. Lacking direct evidence for everywhere isotropy, the 
case for the CP rests more on philosophical rather than on empirical evidence ( [T2"l IT3] ) ; 
it is enough to postulate that we are not privileged observers (the so called Copernican 
principle) to deduce that if the universe appear isotropic about our position, it must 
also appear isotropic to observers in other galaxies. 

The tremendous explanatory power of the standard model of cosmology cannot 
be advocated as an indirect demonstration of the CP, since the Friedmann-Lemaitre- 
Robertson- Walker (FLRW) models are not the only solutions of the Einstein equations 
which are able to fit cosmological observations. In particular, many authors have 
speculated that some effects of the accelerated expansion of the universe [HI [15j [161 El 
IT8] . which remains fundamentally unexplained in terms of microscopic physics, could 
be mimicked by allowing violations of the CP (e.g. [20l fl9] ). This intriguing possibility 
has motivated recent attempts of rooting the CP on a more solid basis. Interestingly, 
there are some encouraging proposals in this direction which are based on the analysis 
of the large-scale maps of CMB anisotropics [211 E21 E31 El] , of galaxies [251 1261 1271 128] 
and of supernovae [29|. 

Even if we postulate the CP, the picture is not complete unless we identify the 
averaging scale that is implicit in this assumption, i.e. the scale on which the FLRW 
model provides an effective, coarse-grained description of the universe [30J. It is 
generically asserted that the CP holds on domains that are large enough to encompass 
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the biggest gravitational structures of the universe. Yet, few studies have attempted to 
narrow in on the length value above which dumpiness gives way to uniformity [3T] . 

Past efforts were mostly based on the analysis of the two-point correlation properties 
of galaxy samples [32j [33]. This approach, however, suffers from severe theoretical 
drawbacks. Since the average number density of the sample is needed as input, the 
method presupposes the premise to be tested, i.e. a constant density distribution of 
matter [31]. Moreover, it does not provide an unambiguous definition of the cross-over 
scale [35]. As a consequence, the inferred homogeneity length-scales depend on the size 
of the analyzed sample and range from values as low as 30/i _1 Mpc up to 200/i~ 1 Mpc 
[361 13T1 f38l 1391 PfOl E] . More recently, orthogonal techniques have been explored which 
are based on the count-in-cells analysis of observations confined to a spatial hyper- 
surface of constant time (e.g. [42J). These methods are insensitive to light cone effects, 

1. e. possible biases arising from comparing galaxy fluctuations at different cosmic epochs 
[43], and seem to indicate a transition to homogeneity at a scale of 70 _1 Mpc [HI H5] 
(but see [H] for an opposite conclusion). In particular, the counting method advocated 
by [H] allows to estimate the homogeneity scale independently from the sample size. 

It is widely believed that, since we cannot point telescopes from any other place but 
the solar system, it is not possible to establish if also distant observers see an isotropic 
universe. While this argument is certainly true for apparent 2D quantities such as, for 
example, the CMB temperature (but see [21]), we show here that it does not apply 
to 3D maps of the spatial distribution of galaxies. Specifically, we quantify the typical 
dimension above which independent observers see an isotropic 'bath' of galaxies. Besides 
establishing an operational definition of the isotropy scale, our approach also provides 
an overall consistency test for the hypothesis that we are not privileged observers of the 
universe. 

2. The Method 

We identify paths of extremal length radiating from a given arbitrary target galaxy to 
every other n th closest neighbour (see Figure 1). The amplitude of the angle t between 
these directions and the observer line-of-sight (los) to the target is computed by assuming 
that the local properties of a homogeneous and isotropic universe are described in terms 
of the infinitesimal Robertson- Walker j2j [3] line element 

ds 2 = (cdt) 2 - a 2 (t)[d X 2 + Z 2 k ( X )(d6 2 + sin 2 6 'dcj) 2 )] 

where, c is the light speed, k is the scalar spatial curvature, x is t ne radial geodesic 
comoving distance, a(t) is the cosmic expansion factor, and where, using the Kronecker 
symbol, £&(%) = <5fc,i sin x + ^k,oX + ^fc,-i sinh X- We compute the length of the geodesic 
XAn between the target galaxy A and its n th closest neighbour by exploiting the fact that 
the spatial part of the cosmic metric is invariant under a quasi-translation transformation 
of its coordinates pQ. We can thus translate the reference frame from the terrestrial 
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Figure 1. Upper We determine the geodesic connections between a given target galaxy 
A and all the surrounding galaxies that lie inside a sphere of radius R centered on A. 
The target galaxy A and its n th closest neighbor subtend an angle A9 at the observer 
position O. The tilting angle t measures the inclination of the geodesic separation XAn 
between A and n with respect to the observer line-of-sight to A (dashed line) . If the CP 
holds on the scale R we expect this 'spaghetti ' to be isotropically oriented about any 
given target. In other terms we expect the los angle t to be isotropically distributed, 
i.e. its PDF is ip(t) — (sini)/2. Lower: the distribution of sin 2 t as a function of the 
geodesic separation \An in a sphere of radius R = 300/i _1 Mpc randomly positioned in 
the LRG SDSS sample. The dotted line represents the theoretically expected average 
(/j = 2/3), while the solid line represents the IGI value, that is the average of the 
plotted points. The shaded area represents the la uncertainty of the IGI value. 
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By orienting the axes in such a way to minimize the number of non-zero components 
(we choose x n/A = (X fe (xAn) , 0, 0), x n/0 = (X>k(Xn) sin A9 , 0, S fc (x„) cos A9) and 
^a/o = (0 , 0, — Efe(x^))), and by exploiting the identity 

Ct(x) + fcEfo) = 1 (2) 

we obtain 



Hl(XAn) = Zl(xn)sm 2 A9 + 

+ T> k {Xn)C k {xA) cos A9 - Efc(xA)Cfc(Xn) 
Consider now the generalized law of sines 



(3) 
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Since t = tt — T, it finally follows from eqs (131) and (jlj) that 



1 

sin 2 t = tt. (5) 

S fc (XA)C fc (Xn) n " 

Sfc(xn) sinA6» 



1 + 



C fe (xA) COtA^ 



If the CP holds, the los angle t has a comoving space Probability Density Function 
(PDF) of a characteristic type (<p(t) = (sin£)/2), namely, it is a random variable 
isotropically distributed with respect to any fundamental observer. Therefore, the 
expectation value \i = (sin 2 t) is cosmology independent and equal to 2/3. We define the 
indicator of galaxy isotropy (IGI) as the estimator tur constructed by averaging equation 
(JS} over n galaxies inside a sphere of comoving radius R that is centered around any 
given observer in the universe. On scales R where the CP applies, we expect the measure 
of m R to converge to the predicted value fi = 2/3 (see Figure 2). 

The testing protocol is as follows: we assume that the CP holds and we implement 
standard statistical inference methods to try to falsify it and reject its validity. In detail, 
we assume the existence of a length-scale R above which the empirical IGI estimates 
(wir) are statistically identical to the theoretical prediction (//). We thus formulate a null 
hypothesis ho according to which the two quantities are not different. We quantify the 

J A straightforward way to obtain it is by repeatedly applying equation ((3|) to the 3 apexes of the 
triangle (S, T, A9) shown in Figure 1, and by isolating, after some algebra, identical terms in the 
resulting expressions. 
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goodness of the agreement by means of x 2 statistics, and, following standard convention, 
we fix the rejection threshold of h , i.e. the risk of reaching the wrong conclusion, at the 
5% level. This means that the hypothesis that the universe is isotropic above a scale R 
cannot be rejected by data if the probability P of obtaining a worst (larger) x 2 value is 
greater than 5%. On the contrary, an eventual failure in identifying the scale of isotropy 
would unambiguously point at the incoherence of the FLRW model. 

Homogeneity and isotropy are properties that characterize the large-scale 
distribution of matter on a 3D spatial hyper-surface at a given instant of time. Since 
light propagates at a finite speed, the most distant regions of the 3D volume directly 
accessible to observations are also the furthest in time. As a consequence, the number 
density of galaxies, an observable that is modulated by local physical processes with 
their own specific time-scales, is expected to vary as a function of distance. This is a 
known issue that hampers most of the tests of the CP [49J. In the following we show 
that, by focusing our attention on the angular distribution of galaxies, instead of their 
number density fluctuations, we can tackle the past light cone issue. 

If the CP holds true, as we assume here, the galaxy spatial number density 
p s {r) within spherical shells of width Ar centered on the terrestrial observer must be 
independent from the distance r. Note that shell-homogeneity, that is p s = const, 
does not imply homogeneity, i.e. invariance under general spatial translations, while the 
opposite is true. More importantly, the radial constancy of p s does not imply everywhere 
isotropy (isotropy about arbitrary comoving observers) that is the fundamental facet of 
the CP that we want to test. As a matter of fact, the distribution of galaxies that 
surrounds us can be characterized by a constant p s and yet be anisotropic. We therefore 
can legitimately remove past-light cone artifacts, by imposing that the comoving number 
density of galaxies be strictly constant within concentric shells centered on us, without 
assuming homogeneity. In practice, we analyze a volume limited catalog of galaxies, 
that is a sample of objects brigther than a given minimum absolute luminosity, and we 
additionally remove, with a random rejection process, any residual radial gradient in 
the distribution of galaxies. As we show in the Appendix, this technique preserves the 
clustering properties of the galaxy distribution, and does not falsely impose homogeneity 
where there is none. 

3. Data 

We apply the method to the seventh release of the Sloan Digital Sky Survey [50] which is 
comprised of ~ 930, 000 galaxies over a field of view of 9380 deg 2 . Our analysis is limited 
to luminous red galaxies (LRG, [51]) distributed in the North Galactic contiguous area 
defined by 120 < RA < 240, 7 < 5 < 56. A sample with a nearly constant density 
of galaxies is obtained by volume limiting the SDSS dr7 catalog in the redshift range 
0.22 < z < 0.5. This sample encompasses a comoving radial size Ar ~ 700/i -1 Mpc (in 
what follows, we consider a cosmological model characterized by the reduced density 
of matter Q m = 0.27 and dark energy = 0.73, and we assume that the value of 
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Figure 2. The IGI value measured by different observers, labeled by their right 
ascension coordinate, is plotted. Each estimate is performed within non-overlapping 
spheres of comoving radius R (shown in the inset) randomly thrown in the volume 
covered by the SDSS galaxy survey. Errors are computed as the standard deviation of 
the mean, and well trace the theoretically expected figure a = i/4/(45n). The average 
IGI value {m R = (sin 2 t)) is 0.678 ± 0.005, 0.672 ± 0.006, and 0.660 ± 0.007 from left 
to right. The solid line shows the expectation value predicted under the assumption 
that the CP holds (i.e. /i = 2/3). On small scales data scatters widely, while on scales 
where the CP is expected to hold, data fit the theoretical prediction. A goodness of fit 
statistical analysis yields x 2 /dof = (2.1, 1.12,0.65) from the the smaller to the larger 
scale. 



the Hubble constant is Hq = 72 km s^MpcT 1 ). The upper redshift limit is fixed by 
the requirement of measuring the IGI with approximately the same average precision of 
nearly 1% over all the interval 100 < R < 200/?r 1 Mpc. 

The strict equality p s (r) = const is then imposed by interpolating the observed 
number density p s of objects in spherical shells centered on us (and with thickness Ar a 
hundred times smaller than the effective depth of the sample), and by randomly rejecting 
galaxies using a Monte Carlo process with selection function 0(r) = min(p s )/p s (r). 
The final LRG sample contains a total of ~ 6500 objects, has a mean number density 
p = 6.14 ■ 10~ 6 /i 3 Mpc~ 3 and covers an effective field of view of 4860 deg 2 . 

4. Analysis of Data and Comparison to Theoretical Models 

For meaningful error interpretation, it is imperative to acquire independent 
measurements of the IGI value, that is of the observable uir = (sin 2 t) estimated using 
eq. (jSJ). Consequently, we do not apply our scheme to every galaxy in the sample, i.e. 
we do not carve spheres of comoving radius R around each 'extraterrestrial' observer 
to determine whether they see the same degree of isotropy. Instead, we only select 
as observers, those target galaxies that are at the center of non-overlapping spheres. 
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As an example, given the geometry of the largest contiguous volume in the SDSS 
survey, we can place a maximum number N = 107, 30, 9, 4, 3 of independent observers, 
exploring the isotropic distribution of galaxies on length-scales R = 100, 150, 200, 250 
and 300/i -1 Mpc respectively. Each of these observers are geodesically connected, on 
average, to n — 26, 87, 206, 401, 695 galaxies respectively. 

Before analyzing real data, we have first applied our method to synthetic samples 
simulating spatially random (Poissonian) galaxy distributions. The point here is to 
detect the minimum radius Ri SO below which our technique is noise-limited and the 
scale of everywhere isotropy cannot be resolved. Using Monte Carlo techniques, we 
have generated various uniform mock catalogues with galaxy number densities in the 
range 10" 4 — 10 _6 /i 3 Mpc~ 3 . We have found that, as expected, when the scale R is 
larger than the mean inter-particle separation A = p -1 ^ 3 , the distribution of the t angle 
statistically converges towards an isotropic PDF. Quantitatively, as soon as R > 1.5 A, 
that is when on average ~ 4 • (1.5) 3 galaxies are geodesically connected to the observer, 
the risk of reaching the wrong conclusion in rejecting the isotropy hypothesis ho becomes 
larger than 5%. In particular, R iso of a spatially random distribution of galaxies with 
the same density of the LRG sample investigated in this study can be unambiguously 
detected on scales larger than ~ 85/i _1 Mpc. 

We have then analyzed the LRG galaxy sample extracted from the SDSS survey. 
The IGI value estimated by distant observers on a scale R = 100, 150, and 200/i~ 1 Mpc 
is graphically shown in Figure 2. It is interesting to note that, for any displayed scale R, 
the distribution of the average IGI values ((m^)) peaks at \i = 2/3, while the variance 
of the distribution decreases as a function of R. The stability of the central value of the 
distribution shows whether isotropy is present on average, whereas the scatter shows 
whether isotropy is present for all observers. In accordance with standard theoretical 
expectations, as the i?-scale increases, all the observers are equally likely to observe 
isotropy, i.e. they lose their specificity and progressively become the 'typical' observer 
of the universe. The upper panel of Figure 3 confirms that the galaxy pattern observed 
from different positions in the universe approaches an isotropic distribution. 

The precise scale of transition to isotropy Ri SO is quantitatively determined as 
follows. First, by randomly rejecting galaxies from the main LRG sample, we 
have constructed 1000 subsamples that satisfies the requirement p s = const. This 
bootstrapping process, allows us to estimate the central moments and the dispersion of 
the P statistics. We have then positioned the centers so that the maximum number 
of non-overlapping spheres of radius R fit inside the survey volume. In particular, 
we require that the position of the extraterrestrial observers change randomly from 
sample to sample. For each length-scale R probed, we have finally computed the risk of 
erroneously rejecting the null hypothesis as the median of P over the 1000 realizations. 
The lower panel of Figure 3. shows that the median risk is larger than 5% for scales larger 
than \50h~ 1 Mpc. Despite the observed spread in the P-values for large R, essentially 
due to the low density of the LRG sample, a statistically significant sharp transition 
towards isotropy at a scale R iso ~ 150/i -1 is unambiguously detected. Notwithstanding, 
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a larger sample might also definitively exclude the hypothesis that the transition happens 
at a scale as low as 120/i -1 Mpc. In fact, the evidence with which such a low R iso is 
currently excluded is not yet conclusive. 

As a matter of fact, the error with which the probability of such a low transition 
scale is estimated from present data is not tightly constrained. 

Our conclusions are, within error bars, independent of the density cut we artificially 
impose to guarantee shell homogeneity. We have verified that samples that are 
everywhere isotropic on a scale Ri SO continue to be everywhere isotropic on that scale 
if the density threshold imposed by the requirements of shell homogeneity is enhanced. 
This conclusion follows from the analysis of subsamples obtained by volume-limiting the 
SDSS catalogue at a lower redshift z max . Interestingly, and in the opposite sense, if the 
sample is isotropic on a scale Ri SO then it continues to be isotropic on that scale, even 
when its density is artificially lowered (by randomly rejecting galaxy members). The 
isotropic length proves robust up until the investigated scale R becomes smaller than 
~ 1.5 times the mean inter-particle separation. Below that threshold, as the analysis of 
random samples already suggested, the predictive power of our indicator breaks down. 

We have compared our measurements to predictions of iV-body simulations of the 
large-scale structure of the universe. To this end, we have analyzed with the same 
technique 50 independent mock catalogs simulating the distribution of LRG galaxies in 
an SDSS-like survey. They were constructed by the LasDamas project (52] using A cold 
dark matter simulations (with characteristic parameters Qm — 0.25, Q\ = 0.75, h = 
0.7, a 8 = 0.8, n. = l). 

Figure 3 quantifies the confidence level with which the hypothesis h cannot be 
rejected on a scale R, and compares it to what is expected in the mock catalogs. Not 
only is a sharp transition towards isotropy at a scale R ~ 150/i -1 Mpc detected in real 
SDSS data, it is observed in synthetic galaxy catalogs too. This excellent agreement 
implies that the scale of isotropy R ~ is a length that characterizes not only 

luminous galaxies, i.e. the visible component of the universe, but also of the most massive 
dark matter halos. The significance of this conclusion is best understood by considering 
that the everywhere isotropy inferred from real data alone, does not give insight into 
the corresponding arrangement of the underlying mass component. 

5. Conclusion 

An acritical acceptance of the Copernican principle might result in what Haynes [53] 
called the "Verrazzano bias" . As in the case of this explorer who, off the coast of the 
outer banks of North Carolina, mistakenly believed that he had discovered the Pacific 
Ocean, it is dangerous to draw definite cosmological conclusions on the basis of limited 
data collected from a special spatial position. 

In this paper we have presented a new geometrical tool that allows us to assess 
whether or not, from the view point of a distant galaxy, the large-scale structure of the 
universe appears almost identical to its aspect from earth. Virtually all of the previous 



The Scale of Cosmic Isotropy 



10 




50 100 150 200 250 300 350 
R (h-'Mpc) 



Figure 3. Upper: the observed PDF of the los angle t (histogram) is compared to the 
isotropic prediction ((f(t) = (sini)/2) on a scale R = 150/i -1 Mpc. The ratio between 
model deviations and data errors is also plotted (together with the lines indicating 
lcr and 2a deviations). Lower: Dots show the levels of significance at which we fail 
to reject ho on a given scale R. We conventionally assume that the hypothesis of 
everywhere isotropy cannot be rejected when the risk of an erroneus decision is larger 
than 5% (this threshold is represented by the solid blue line). The levels of significance 
are computed as the median of the probability P inferred from 1000 resamplings of 
the SDSS LRG sample that are shell homogeneous. Error bars bracket the first and 
third quartile of the distribution of P at any given scale R. The blue dotted line shows 
the average expectation for P extracted from the analysis of 50 ACDM mock catalogs 
simulating the LRG SDSS sample. The red envelope shows the region bracketing la- 
fluctuations around the average expectation extracted from simulations. 
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attempts to identify the coarse graining scale above which the visible distribution of 
matter comply with the requirements of the CP have focused on the analysis of the 
so called homogeneity scale. In this paper we have addressed this same issue from 
a different angle. We propose to identify this fundamental length with the scale of 
everywhere isotropy Ri SO , the scale above which the distribution of galaxies appears 
isotropic to every comoving observer, that we define as the smallest scale at which the 
probability of wrongly rejecting the CP is smaller than 5%. 

By analyzing state-of-the-art data, we have found that the galaxy distribution, as 
traced by luminous red galaxies, appears isotropic to every comoving observer in the 
universe once the averaging scale is larger than Ri SO ~ 150/i -1 Mpc. This figure is 
in excellent agreement with predictions of the spatial clustering of galaxies in ACDM 
simulations. 

The advantage of the method is that it is insensitive to the shape of the radial 
selection function of the redshift sample analyzed, i.e. to the effective number of objects 
that sample the underlying clustering of galaxies as a function of redshift. As a matter 
of fact, it is straightforward to subtract look-back time issues once the focus is shifted 
from counting objects (the standard methodology of the homogeneity tests) to measuring 
angles (as implemented by our strategy). 

Since the matter distribution converges continuously towards homogene- 
ity/isotropy, it is quite arbitrary to decide which criterium must be adopted to single 
out an exact scale of transition. In this work we adopt the point of view that the most 
natural way to test the CP is to assign a probability to the hypothesis that this model is 
wrong. The goal is to frame the analysis of its coherence within the domain of probabil- 
ity theory, as the intrinsically statistical nature of this cosmological statement explicitly 
demands. This helps elucidating the meaning of such generic sentences as "....the CP 
holds on a scales larger than XXX Mpc" and will ease the quantitative comparison of 
the results obtained with different and independent methods. 

Deeper redshift surveys of the universe (such as, for example, BOSS, BigBOSS 
or EUCLID) are currently ongoing or expected to be soon completed. It would be 
interesting to understand if the scale of everywhere isotropy does scale as a function 
of cosmic time as predicted by numerical simulations of the gravitational clustering in 
the universe. This will confirm that the CP is not some temporary assertion about 
the present day appearance of the universe but a fundamental property of matter 
distribution at all cosmic epochs. Even more importantly, it will help us to shed light on 
the physics behind the large-scale uniformity of the universe by answering the question: 
where does this scale come from? 
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6. Appendix A 

The fair sample model of the universe (Layzer 1956), assumes that the galaxy 
distribution is a discrete stochastic process resulting from the Poissonian sampling 
of an underlying continuous matter density field A(x). Accordingly, a galaxy sample 
that traces an underlying continuous field of PDF Q(A) can be modeled as a discrete 
stochastic process in which the probability of counting A" galaxies within a given 
arbitrary cell is 



N 



P(N\A)Q(A)dA, (6) 



where the conditional sampling probability is 

A-JV 

P(N\A) = — e-\ (7) 

In the ideal universe described by the FLRW model, the continuous matter fluid 
A(x) satisfies exactly the CP and its PDF is the Dirac delta Q(A) = 5 D (A - A ). 
Equations and (|7|) therefore imply that P/v follows a Poissonian statistic, that is the 
spatial distribution of galaxies is random. 

We now demonstrate that a spatially random sample of size K cannot be generated 
by randomly sampling, with probability p, a parent population of size A" whose spatial 
distribution is clustered, that is inhomogeneus. Let the sampling process be described 
by 

K = <&( x i) + $(x 2 ) + +$(x JV ), (8) 

where $ is a random variable (taking on the values or 1) distributed according to the 
Bernoulli probability law (that isp*(l— p) 1- *, where p is the probability that $ = 1 in a 
single trial), and where AMs a non-negative integer- valued random variable distributed 
according to the Poisson distribution of average counts N . 
The probability generating function (PGF) of P/v is 

oo 

= (9) 

i=0 
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and its expression in the case of the Bernoulli and Poisson processes is 

g*(z) = (i-p)+ P z (io) 

and 

g N (z) = e^ R , (11) 

respectively. 

It can be shown [55] that the PGF of the sum of independent, identically distributed 
random variables i.e. the PGF of the random variable K in equation (jHJ) is 

Gk{z) = Gn{Gz{z)). (12) 

Let's assume that Qk is the PGF of a Poissonian distribution with average 
parameter n. By taking the derivative of eq. ( fl2i) we obtain 

- [ (g*ig=a _i]n dg N 

ne l p J = -j^—p (lo) 
which, upon integration gives 

g N {u) = e {u - 1) * +C. (14) 

The value of the arbitrary constant C can be set to using the additional condition 
(?iv(l) = 1 (see eq. Qj). Therefore, a spatially random distribution cannot be the result 
of the random sampling of a non-Poissonian distribution. Reciprocally, one can show 
that a random sampling of a Poisson parent distribution, results in a subsample of 
elements with Poisson distribution. 

We now show that also the Appoint moments of the continuous mass distribution 
A(x) are not modified by a random sampling process. Given the counts iVj = iV(xj) 
in a cell at position Xj, the N point PGF is immediately obtained by generalizing the 
expression given in eq. (J9j) 

G(zi,Z2,....,Z n ) = 2J ••• S ^PN 1 ,N 2 ,...N n Zi 1 Z2 2 ....Z^ n , (15) 

where 

P Nl ,N 2 ,..., Nn = J P(N 1 \A 1 )....P(N n \A n ))Q(A 1 )...Q(A n )dA 1 dA n , (16) 

and where Aj = A(xj). If the sampling is random, the Appoint PGF is given by 

G(z u z 2l z n ) = (fAite-^eM*-!) e ^(z n -i)y (17) 

The moment generating function associated to the discrete, A^-point, counts 

Ni, N 2 , N n follows immediately by substituting Zi — >■ e Zi in the argument of the 
PGF |56] 

M(z u z 2 , z n ) = ( e Me*i-i) e Me**-i)^ _ } gM^-l)). (is) 
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The Appoint moment of the galaxy distribution, is calculated as the functional 
derivative of the characteristic function [57] 

(iV( Xl )iV(x 2 ) N(x n )}= [ M I (19) 

0Z\0Z2 0Z n 21=22 = =2 n =0 

By substituting eq. ( IT8|) into the previous one, we see that the Appoint galaxy 
moment is equivalent to the corresponding statistics computed for the underlying 
continuous matter field, i.e. 

(iV(x 1 )iV(x 2 ) iV(x„)> = (A( Xl )A(x 2 ) A(x n )>. (20) 

As a result, the whole hierarchy of N-point galaxy correlation functions computed 
from a random (discrete) sample trace with fidelity the Appoint correlation function 
of matter. Again, an inhomogeneous spatial distribution cannot be turned into a 
homogeneous one by a random sampling process. 



